Technical supplement to “ Consistent probabilistic outputs for protein function prediction ”

نویسندگان

  • Guillaume Obozinski
  • Gert Lanckriet
  • Michael I. Jordan
چکیده

Protein function prediction, in the context of the Gene Ontology, is a task that consists of answering, for a fixed protein X, a large number of binary questions of the form: " Does protein X belong to GO term Y ? " Those binary classification problems are strongly related because the ontology consists of nested classes. Two natural requirements for this prediction problem are • that the set of predictions be consistent, i.e., that if a protein is assigned a GO term, then it is all also assigned all the ancester GO terms, and • that high-confidence predictions can be produced with a quantified confidence level. Methods of structured classification proposed in machine learning Taskar et al. [2003] could in theory be used to tackle this problem. However, two practical difficulties that need to be surmounted are the large amount of missing data and the large scale of the 1

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ارائه یک مدل احتمالاتی برای توزیع خوردگی یکنواخت در سکوهای ثابت فلزی در خلیج فارس

For structural reliability assessment or risk analysis of aging offshore steel structures, it is essential to have a probabilistic model, which contains specific statistical parameters, and predicts long term corrosion loss as a function of time. The aim of this study is to propose such model for offshore jacket platforms in the Persian Gulf. Field measurements for material loss due to uniform ...

متن کامل

Application of Linear Regression and Artificial NeuralNetwork for Broiler Chicken Growth Performance Prediction

This study was conducted to investigate the prediction of growth performance using linear regression and artificial neural network (ANN) in broiler chicken. Artificial neural networks (ANNs) are powerful tools for modeling systems in a wide range of applications. The ANN model with a back propagation algorithm successfully learned the relationship between the inputs of metabolizable energy (kca...

متن کامل

Fuzzy Integral Based Data Fusion for Protein Function Prediction

Data fusion using diverse biological data has been applied to predict the protein function in recent years. In this paper, fuzzy integral fusion based on fuzzy measure is used to integrate the probabilistic outputs of different classifiers. Support vector machines as base learners are applied to predict the functions of examples from each data source. Fuzzy density values are determined by Part...

متن کامل

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

On the use of back propagation and radial basis function neural networks in surface roughness prediction

Various artificial neural networks types are examined and compared for the prediction of surface roughness in manufacturing technology. The aim of the study is to evaluate different kinds of neural networks and observe their performance and applicability on the same problem. More specifically, feed-forward artificial neural networks are trained with three different back propagation algorithms, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008